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stances, the results of the second search 
dominate the forwarding decision, and 
in still other instances, the default infor- 
mation dominates. 



Multiliycr 32 - 

Network Element 



36 



Processor 




Processor 
Memory 







Switching 

Element 





Forwtrdifli 



Memory 



Picket 
Buffer 
Memory 



SDOCID. <WO 99009«aAl_t_> 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCT on the front pages of pamphlets publishing international applications under the PCT. 



AL 


Albania 


ES 


Spain 


LS 


Lesotho 


SI 


Slovenia 


AM 


Armenia 


Fl 


Finland 


LT 


Lithuania 


SK 


Slovakia 


AT 


Austria 


FR 


France 


LU 


Luxembourg 


SN 


Senegal 


AU 


Australia 


GA 


Gabon 


LV 


Latvia 


SZ 


Swaziland 


AZ 


Azerbaijan 


GB 


United Kingdom 


MC 


Monaco 


TD 


Chad 


BA 


Bosnia and Herzegovina 


GE 


Georgia 


MD 


Republic of Moldova 


TG 


Togo 


BB 


Barbados 


GH 


Ghana 


MG 


Madagascar 


TJ 


Tajikistan 


BE 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 


Turkmenistan 


BF 


Burkina Faso 


GR 


Greece 




Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


HU 


Hungary 


ML 


Mali 


TT 


Trinidad and Tobago 


BJ 


Benin 


IE 


Ireland 


MN 


Mongolia 


UA 


Ukraine 


BR 


Brazil 


IL 


Israel 


MR 


Mauritania 


UG 


Uganda 


BY 


Belarus 


IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 


Canada 


IT 


Italy 


MX 


Mexico 


UZ 


Uzbekistan 


CF 


Central African Republic 


JP 


Japan 


NE 


Niger 


VN 


Viet Nam 


CG 


Congo 


KE 


Kenya 


NL 


Netherlands 


YU 


Yugoslavia 


CH 


Switzerland 


KG 


Kyrgyzstan 


NO 


Norway 


ZW 


Zimbabwe 


CI 


Cdte d'lvohx 


KP 


Democratic People's 


NZ 


New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


Poland 






CN 


China 


KR 


Republic of Korea 


FT 


Portugal 






CU 


Cuba 


KZ 


Kazakstan 


RO 


Romania 






CZ 


Czech Republic 


LC 


Saint Lucia 


RU 


Russian Federation 






DE 


Germany 


U 


Liechtenstein 


SD 


Sudan 






DK 


Denmark 


LK 


Sri Lanka 


SE 


Sweden 






EE 


Estonia 


LR 


Liberia 


SG 


Singapore 







1 



3NSOOCIO- <WO _.^90O948AT 1 > 



WO 99/00948 



A SYSTEM AND METHOD FOR 
A MULTI-LAYER NETWORK ELEMENT 



PCT/US98/13016 



FIELD OF THE INVENTION 
5 The present invention relates in general to packet forwarding within a network 

and, in particular, to a system and method for forwarding packets using multi-layer 
information. 

BACKGROUND OF THE INVENTION 
10 Communication between computers has become an important aspect of 

everyday life in both private and business environments. Networks provide a medium 
for this communication and further for communication between various types of 
elements connected to the network such as servers, personal computers, workstations, 
memory storage systems, or any other component capable of receiving or transmitting 

15 data to or from the network. The elements communicate with each other using defined 
protocols that define the orderly transmission and receipt of information. In general, 
the elements view the network as a cloud to which they arc attached and for the most 
pan do not need to know the details of the ncrwork architecture such as how the 
network operates or how it is implemented. Ideally, any network architecture should 

20 support a wide range of applications and allow a wide range of underlying 

technologies. The ncrwork architecture should also work well for very large networks, 
be efficient for small networks, and adapt to changing network conditions. 

Networks can be generally be differentiated based on their size. At the lower 
end. a local area network (LAN) describes a network having characteristics including 

25 multiple systems attached to a shared medium, high total bandwidth, low delay, low 
error rates, broadcast capability, limited geography, and a limited number of stations, 
and arc generally not subject to post, telegraph, and telephone regulation. At the upper 
end, an enterprise network describes connections of wide area networks and LANs 
connecting diverse business units within a geographically diverse business 

30 organization. 



SUBSTITUTE SHEET (RULE 26) 

S DOC ID <WO 9900948A1_l_> 



WO 99/00948 PCT/US98/1 301 6 

To facilitate communication within larger networks, the networks are typically 
partitioned into subnetworks, each sharing some common characteristic such as 
geographical location or functional purpose, for example. The partitioning serves two 
main purposes: to break the whole network down into manageable pans and to 
5 logically (or physically) group users of the network. Network addressing schemes may 
take such partitioning into account and thus an address may contain information about 
how the network is partitioned and where the address fits into the network hierarchy. 

For descriptive and irtipiementive purposes, a network may be described as 
having multiple layers with end devices attached to it, communicating with each other 

10 using peer-to-peer protocols. The well-known Open Systems Interconnection (OSI) 

Reference Model provides a generalized way to view a network using seven layers and 
is a convenient reference for mapping the functionality of other models and actual 
implementations. The distinctions between the layers in any given model is clear, but 
the implementation of any given model or mapping of layers between different models 

15 is not. For example, the standard promulgated by the Institute of Electrical and 

Electronics Engineers (IEEE) in its 802 protocols defines standards for LANs and its 
definitions overlap the bottom two layers of the OSI model. 

In any such model, a given layer communicates either with the same layer of a 
peer end station across the network, or with the same layer of a network clement 

20 within the network itself. A layer implements a set of functions that are usually 
logically related and enable the operation of the layer above it. 

The relevant layers for describing this invention include OSI Layers I through 
4. Layer 1, the physical layer, provides functions to send and receive unstructured bit 
patterns over a physical link. The physical layer concerns itself with such issues as the 

25 size and shape of connectors, conversion of bits to electrical signals, and bit-level 
synchronization. More than one type of physical layer may exist within a network. 
Two common types of Layer I are found within IEEE Standard 802.3 and FDDI (Fiber 
Distributed Data Interface). 

Layer 2, the data link layer, provides support for framing, error detecting, 

30 accessing the transport media, and addressing between end stations interconnected at or 
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below layer 2. The data link layer is rypically designed to carry packets of information 
across a single hop, i.e., from one end station to another within the same subnet, or 
LAN. 

Layer 3, the network layer, provides support for such functions as end to end 
5 addressing, network topological information, routing, and packet fragmentation. This 
layer may be configured to send packets along the best "route" from its source to its 
final destination. An additional feature of this layer is the capability to relay 
information about network congestion to the source or destination if conditions 
warrant. 

10 Layer 4, the transport layer, provides application programs such as an electronic 

mail program with a "port address" which the application can use to interface with the 
data link layer. A key difference between the transport layer and the lower layers is 
that an application on a source end station can carry out a conversation with a similar 
application on a destination end station anywhere in the network; whereas the lower 

15 layers carry on conversations with end stations which are its immediate neighbors in 
the network. Layer 4 protocols also support reliable connection oriented services, an 
example Layer 4 protocol providing such services is the Transport Control Protocol 
(TCP). 

Different building blocks exist for implementing networks that operate at these 
20 layers. End stations arc the end points of a network and can function as sources, 

destinations and network elements or any other intermediate point for forwarding data 
received from a source to a destination. 

At Jthe simplest level are repeaters which are physical layer relays which simply 
forward bits at Layer 1. 
25 Bridges represent the next level above repeaters and are data link layer entities 

which forward packets within a single LAN using look-up tables. They do not modify 
packets, but just forward packets based on a destination. Most bridges are learning 
bridges. In these bridges, if the bridge has previously learned a source, it already 
knows to which port to forward the packet. If the bridge has not yet forwarded a 
30 packet from the destination, the bridge does not know the port location of the 

3 
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destination; and forwards the packet to all unblocked output pons, excluding the port 
of arrival. Other than acquiring a knowledge of which ports sources are transmitting 
packets to, the bridge has no knowledge of the network topology. Many LANs can be 
implemented using bridges only. 
5 Routers are network layer entities which can forward packets between LANs. 

They have the potential to use the best path that exists berween sources and 
destinations based on information exchanged with other routers that allow the routers 
to have knowledge of the topology of the network. Factors contributing to the "best" 
path might include cost, speed, traffic, and bandwidth, as well as others. 

10 Brouters are routers which can also perform as bridges. For those layer 3 

protocols of which the brouter knows, it uses its sofrware to determine how to forward 
the packet. For all other packets, the brouter acts as a bridge. 

Switches are generalized network elements for forwarding packets wherein the 
composition of the switch and whether it implements layer 2 or layer 3 is not relevant. 

15 Typically, bridges forward packets in a flat network without any cooperation by 

the end stations, because the LAN contains no topological hierarchy. If a LAN, for 

example, is designed to support layer 3 functionality, then routers are used to \ 
interconnect and forward packets within the LAN. 

Bridges cannot use hierarchical routing addresses because they base their 

20 forwarding decisions on media access control (MAC) addresses which contain no 

topological significance. Typically MAC addresses are assigned to a device at its time 
of manufacture. The number of stations that can be interconnected through bridges is 
limited because traffic isolation, bandwidth, fault detecting, and management aspects 
become too difficult or burdensome as the number of end stations increases. 

25 Learning bridges self-configure, allowing them to be "plug and play" entities 

requiring virtually no human interaction for setup. Routers, however, require intensive 
configuration, and may even require configuration activities at the end nodes. For 
example, when a network utilizes the Transmission Control Protocoi/Intemet Protocol 
(TCP/IP), each end node must manually receive its address and subnet mask from an 

30 operator, and such information must be input to the router. 
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Generally, as the size and complexity of a network increases, the nerwork 
requires more functionality at the higher layers. For example, a relatively small LAN 
can be implemented by using Layer 1 elements such as repeaters or bridges, while a 
very large network uses up to and including Layer 3 elements such as routers. 
5 A single LAN is rypically insufficient to meet the requirements of an 

organization because of the inherent limitations: (I) on the number of end stations that 
can be attached to a physical layer segment; (2) the physical layer segment size; and 
(3) the amount of traffic, which is limited because the bandwidth of the segment must 
be shared among all the connected end stations. In order to overcome these constraints, 

10 other network building blocks are required. 

As briefly described above, when the number of end stations in a network 
increases, the network may be partitioned into subnetworks. A typical address in a 
partitioned network includes two pans: a first part indicating the subnetwork; and a 
second pan indicating an address within the subnetwork. These types of addresses 

15 convey topological information because the first pan of the address defines 

geographical or logical ponions of the network and the second pan defines an end 
station within the subnetwork ponton. Routing with hierarchial addressing involves two 
steps: first packets are routed to the destination s subnetwork; and second packets are 
forwarded to the destination within the subnetwork. 

20 An end station receives a unique data link address — the MAC address — at the 

time of manufacture, allowing the end station to attach to any LAN within a bridged 
network without worrying about duplicate addresses. Data link addresses therefore 
cannot convey any topological information. Bridges, unlike routers, forward packets 
based on data link addresses and thus cannot interpret hierarchical addresses. 

25 The current Internet is being forced to deal with increasing numbers of users 

and increasing demands of multimedia applications. Future networks will be required 
to suppon even higher bandwidth, larger numbers of users, and traffic classification 
requirements by the network. Statistical studies show that the network domain as well 
as the number of workstations connected to the network will grow at a faster rate in 

30 future. The trend is also to suppon multiple traffic types with varied characteristics on 
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a same physical link. This calls for more nerwork bandwidth and efficient usage of 
resources. To meet the bandwidth requirement, the speed on the networks is on the 
upward trend, reaching to gigabit speeds. 

Network designers frequently use one particular combination of ISO Layer 2 
5 and Layer 3 because of the success of the Internet and the increasing number of 

products and networks using the Internet. Specifically, in a rypical Internet-associated 
network, designers combine an implementation in accordance with the IEEE 802 
Standard (which overlaps ISO Layer 1 and Layer 2) with the Internet Protocol (IP) 
network layer. This combination is also becoming popular within enterprise networks 

10 such as intranets. 

Supporting this combination by building nerworks out of layer 2 nerwork 
elements provides fast packet forwarding but has little flexibility in terms of traffic 
isolation, redundant topologies, and end-to-end policies for queuing and administration 
(access control). Building such networks out of layer 3 elements alone sacrifices 

15 performance and is impractical from the hierarchical point of view because of the 

overhead associated with having to parse the layer 3 header and .modify the packet if 
necessary. Funhermore, using solely layer 3 elements forces an addressing model with 
one end station per subnet, and no layer 2 connectivity between the end stations. 

Networks built out of a combination of layer 2 and layer 3 devices are used 

20 today, but suffer from performance and flexibility shortcomings. Specifically, with 

increasing variation in traffic distribution (the role of the "server" has multiplied with 
browser-based applications), the need to traverse routers at high speed is crucial. 

The choice between bridges and routers rypicaily results in significant tradeoffs 
(in functionality when using bridges, and in speed when using routers). Furthermore, 

25 the service characteristics, such as priority, within a nerwork are generally no longer 
homogeneous, despite whether traffic patterns involve routers. In these networks, 
differing traffic types exists and require different service characteristics such as 
bandwidth, delay, and etc. 

To meet the traffic requirements of applications, the bridging devices should 

30 operate at line speeds, i.e., they operate at or faster than the speed at which packets 
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arrive at the device, but they also must be able to forward packets across 
domains/subnetworks. Even through current hybrid bridge/router designs are able to 
achieve correct network delivery functions, they are not able to meer today's 
increasing speed requirements. 
5 What is needed is a switch or network element that forwards both layer 2 and 

layer 3 packets quickly and efficiently both within a subnerwork and across 
subnetworks, and to other networks. Further, a network element is needed that can 
forward layer 3 packets at wire-speed, i.e., as fast as packets enter the network 
element. Additionally, a network element is needed that allows layer 2 forwarding 
10 within a subnetwork to have the additional features available in layer 3 routing and to 
provide certain qualiry of service for applications within the subnetwork, such as 
priority and bandwidth reservation. 

SUMMARY OF THE rNVENTION 
15 The present invention enables the above problems to be substantially overcome 

by providing a system and method for an multi-layer network element for forwarding 
received packets to one or more appropriate output pons. 

An embodiment of the present invention includes a method of forwarding an 
packet entering from an input port to one or more appropriate output ports based on 
20 single searches of an associative memory for each layer. The associative memory 
contains certain qualiry of service information that may be applied to any layer. 

A packet is received on an input port, and from the packet a first search key is 
created based on the header of the packet. An associative memory lookup is performed 
for the first search key, which results in two potential forwarding decisions for the 
25 packet. If the first search key matches an entry to a destination address found in the 
first search key, i.e., a matching entry is found in the associative memory, then the 
potential output port or ports are those associated with the destination address as found 
in the associative memory. If the destination address does not match any entry in the 
associative memory, then all ports except the incoming port are candidates for the 
30 potential output port or ports. 
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The packet is also categorized by-class to aid in creating a second search key. 
Packets of a particular class share common characteristics, such as what portions of the 
header will be used to create the second search key. A class also defines certain default 
forwarding information for packets within the class. The default information may 
5 include certain quality of service information. 

An associative memory lookup is performed using the second search key. The 
results of this second search, the first search, and the default information are combined 
to determine which of the potential output port or ports as proffered by the three 
searches is the most appropriate for this packet. The packet is then forwarded to the 

10 appropriate output port or pons. 

In some instances, the second search key yields no match in the associative 
memory. In these cases, the default information is combined with the results of the first 
search. Furthermore, the results of the first search may override any of the other 
forwarding information; and the results of the second search may force the results of 

1 5 the first search to be used to forward the packet. 

In one embodiment, the invention implements forwarding of layer 2 and layer 3 
packets. In this embodiment, the first search key includes information about layer 2 
destination addresses and the second search key and default information include 
information about layer 3 and possibly layer 4. 

20 Such an implementation, in one embodiment, allows certain quality of service 

to be applied to layer 2 forwarding in the following manner. When a packet enters the 
network element as a layer 2 packet, the first search key will result in layer 2 
forwarding information being output from the associative memory. The class of the 
packet will be determined and the packet provided with default class information that 

25 may include certain quality of service information. The second search key, however, 

may not yield any results from the associative memory because an entry in the memory 
has not yet been created by the central processing unit. In this instance, merge logic 
will use the layer 2 forwarding result but also use the quality of service information 
from the default forwarding information. Such a feature allows the network element to 

30 be configured to provide quality of service to layer 2 traffic within a subnetwork. 

8 
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Still other embodiments of the present invention will become readily app?rent 
to those skilled in the an from the following detailed description, wherein is shown and 
described only the embodiments of the invention by way of illustration of the best 
modes contemplated for carrying out the invention. As will be realized, the invention 
5 is capable of other and different embodiments and several of its details arc capable of 
modification in various obvious respects, all without departing the spirit and scope of 
the present invention. Accordingly, the drawings and detailed description are to be 
regarded as illustrative in nature and not as restrictive. 

10 BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 illustrates a system incorporating a multi-layer network element 
according to the invention. 

Fig. 2 illustrates the multi-layer networking element of Fig. 1. 

Fig. 3 illustrates the switching element of the multi-layer network element in 

15 more detail. 

Fig. 4 illustrates the forwarding logic of the switching element in more detail. 
Fig. 5 illustrates the class logic of Fig. 4 in more detail. 

Fig. 6 illustrates the process used in determining which information dictates a 
packet's path through the multi-layer network element. 
20 Fig. 7 illustrates the information dependency in determining how to forward a 

packet out of the network element. 



DETAILED DESCRIPTION 
Fig. 1 illustrates a system incorporating a multi-layer network element 
25 according to the present invention. The system includes the multi-layer network 

element, various networks, end stations, routers, and bridges. By way of example and 
as broadly embodied and described herein, a system 10 incorporating a multi-layer 
network element 12 according to the present invention includes networks 14 and 16, 
end stations 18, router 24, bridge 26, and local area networks (LAN) 28. 



9 
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The bridge 26 connects some of the LANs 28 and end stations IS to the 
network 14 and to each other. The bridge 26 may be a conventional learning bridge. 
The bridge 26 keeps track of the addresses of the end stations 18 that transmit a packet 
showing up on one of pons 30 to the bridge 26. The end stations IS may be any 
5 device capable of sending or receiving packets of information. Typically, the end 
stations 18 are personal computers, workstations, printers, servers, and/or any other 
device that can be connected to a network. 

The bridge 26 initially does not know on which of its pons packet destinations 
are located, and must flood an incoming packet to all pons in order to properly 
10 forward the packet. Once the bridge 26 receives a packet destined for an address it 

already recognizes, the bridge 26 knows what pon the destination is on so that it does 
not have to flood the packet on all outgoing pons. Eventually, the bridge 26 has 
learned enough addresses to all but eliminate the amount of flooding needed on the 
ports. Of course, any time an end station 18 changes pons on the bridge 26, the bridge 
15 26 must reiearn the end station 18*s pon. 

The bridge 26 typically does not modify the packet, contains no information 
about the topology of the network 14. and examines few pans of the packet header. 
The bridge 26 operates quickly because it makes no modifications to the packet and is 
only concerned with learning sources and forwarding to destinations. Typically, bridges 
20 26 use look-up tables to search for sources and destinations. 

The router 24 connects the network 14 to the networks 16. Only one router 24 
is illustrated by way of example, but there may be many routers connecting other 
networks or end stations IS. The router 24 provides the communication necessary 
between the network 14 and ihe networks 16 and may a conventional router. Such 
25 routers include layer 3 functionality for forwarding packets to an appropriate 

destination including route calculation, packet fragmentation, and congestion control. 
Routers of this type are described, for example, in Interconnections- Bridge and 
Routers by Radia Perlman published by Addison-Wesley. The router 24 must have 
knowledge of the topology of the network in order to determine the best route for 



10 
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packets. The router 24's knowledge of the network is gained through topological 
information passed between multiple such routers 24 connected to the network 14. 

Software running on the router 24 parses an incoming packet to determine 
various characteristics about the packet, including the rype of the protocol being used 
5 and the source and destination(s). Other determinations based on examining the packet 
may be necessary, such as priority and quality of service (QoS) factors such as priority 
and bandwidth reservation. The router 24 then uses the extracted information and 
computes the next destination for the packet based on topology and route information 
that is stored in the memory of the router 24. The router 24 also applies QoS rules and 
10 actions. 

The router 24's process for calculating the next destination may require many 
accesses to memory and computation of the route from that information. Furthermore, 
the packet is typically received and stored while any processing is taking place. After 
the router 24 has determined what actions are necessary on the packet, any 

15 modifications arc made to the packet as stored in the memory or on the way out of the 
router 24. The routers 24 are typically required to replace the layer 2 source and 
destination of the packet, update any checksums of the packet, and handle any issues 
related to packet lifetime. 

To carry out the functions that the conventional router 24 performs, the 

20 software examines memory locations, make modifications to the packet, and calculate 
new values for some fields. Such actions provide increased functionality beyond simple 
packet forwarding like that found in bridges 26 such as determining the best route for 
the packet, providing QoS features; however, in conventional routers 24 such actions 
take up valuable time. 

25 The network 14 provides communication paths for all of the elements 

connected to it. In the example of Fig. I, the elements include the multi-layer network 
element 12, router 24, and bridge 26. Any number of elements could be connected to 
the network 14 in a multitude of ways. Fig. 1 illustrates only one possible 
combination. The elements connected to the network 14 do not require the network 14 

1 1 
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to be of any particular size or configuration. For the end stations 18 and the bridge 26, 
a detailed topoiogical knowledge of the network 14 is not required. 

The multi-layer network element 12 according to the present invention connects 
various elements to the network 14 and to each other. As illustrated by way of 
5 example, the multi-layer network element 12 connects a LAN 28, the end stations IS, 
and the network 14. The multi-layer network element 12 combines the functions of 
both a bridge and a router. Functioning as a router, the multi-layer network element 12 
contains topological information about network 14 to intelligently route a packet to its 
destination while providing associated layer 3 functionality typically found in a router 
10 24. Functioning as a bridge, the multi-layer network element 12 learns source/port 
combinations to forward layer 2 packets. The multi-layer network element 12 differs 
from conventional bridge/router combinations in that certain layer 3 processing 
operates as quickly as layer 2 switching found in the bridge 26. 

Fig. 2 illustrates the multi-layer network element 12 of Fig. 1 in more detail. 
15 The multi-layer network element 12 according to one embodiment of the invention 

includes a processor 32, a processor memory 34, a switching element 36, a plurality of 
network element pons 38, a forwarding memory 40, an associated memory 42, and 
packet buffer memory 44. The end stations 18, the LAN 28, and the nerwork 14 arc 
connected to the multi-layer network element 12 using a plurality of network element 
20 pons 38. Other multi-layer network elements 12 may also be connected to the multi- 
layer network element 12. 

The switching element 36 is also connected to the processor 32, the forwarding 
memory 40, the associated memory 42, and the packet buffer memory 44. The 
processor 32 is also connected to the memory 34, Forwarding memory 40 and 
25 associated memory 42 is connected to each other as well to as switching element 36. 

The switching element 36 performs most of the packet forwarding functions 
using both layer 2 and layer 3 information, and possibly also some layer 4 information, 
stored in forwarding memory 40 and associated memory 42, without having to rely on 
the processor 32 to calculate routes or determine appropriate actions on every packet. 



rMSDOCD <WO ?OO09dJ9Ai i > 



SUBSTITUTE SHEET (RULE 26) 



WO 99/00948 PCT/US98/13016 

The processor 32 performs tasks that the switching element 36 is not equipped 
to handle. For example, when new layer 3 routes must be calculated, the processor 32 
uses processor memory 34, which contains detailed information about the topology of 
any networks reachable from the multi-layer network element 12. The processor 32 
5 makes its computations primarily using software programming units in conjunction 
with accesses to the memory 34. The switching element 36 makes its decisions 
primarily in hardware, using the forwarding memory 40 and the associated memory 42. 
The forwarding memory 40 and the associated memory 42 contain only a portion of 
the information contained in the memory 34, and are configured for quick access and 
10 retrieval. 

Fig. 3 illustrates a detailed view of the switching element 36 and its 
connections to the processor 32, the plurality of network element pons 38a-n, the 
forwarding memory 40, the associated memory 42, and the packet buffer memory 44. 
The switch element 36 includes input pons 50a-n, a forwarding logic 52, a packet 

15 memory manager 54, and output pons 56a-n. Each input pon 50i and output pon i 

corresponds to a network element pon 38L Each of the inputs pons 50 also connects to 
both the forwarding logic 52 and the packet memory manager 54. 

For a given i, an input pon 50i receives packets from its respective multi-layer 
network clement pon 38i and tests the packets for correctness. If the packet is ill 

20 formed, it is discarded. Packets passing this initial screening are '.emporarily buffered 
by the input pon 50i. Once the input pon 50i has buffered at least the first 64 bytes of 
the received packet, the input pon 50i passes the header to the forwarding logic 52. 

The forwarding logic 52 is connected to the processor 32, the forwarding 
memory 40, and the associated memory 42. The forwarding logic 52 performs several 

25 functions. It initially screens the packet to determine whether the packet is 

encapsulated, by for example Subnerwork Access Protocol (SNAP), or whether the 
packet is tagged, for example, by a virtual LAN (VLAN) identifier. If the packet is 
either of those two types, the forwarding logic 52 uses offset information to locate 
appropriate layer header information needed for funher processing. 
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The forwarding iogic 52 also searches the forwarding memory 40 for matches 
at layer 2 and/or layer 3. The search may also include some information at layer 4. In 
the preferred embodiment, the forwarding memory 40 is a content-addressable memory 
(CAM) storing information about both layer 2 and layer 3 switching, and may contain 
5 some layer 4 information. If a match is found, data stored in associated memory 42 

and pointed to by the matching entry in the forwarding memory 40 serves to define the 
actions that the switching element 36 must do to forward the packet to the appropriate 
destination(s). 

In another embodiment, the forwarding memory 40 could be implemented using 
10 an sequentially address random access memory. In this embodiment, a hashing function 

would be preformed on the particular key. The resulting hashed value would be an 

address into the memory 42 associated with the pre-hashed key. 

In still another embodiment, the forwarding memory 40 and the associated 

memory 42 could be contained in a single random access memory. In one 
15 implementation of that single random access memory, the entries could be accessed 

sequentially, requiring a hash-front end. Another implementation of that single random 

access memory could be a CAM. 

The packet memory manager 54 is connected to the packet buffer memory 44, 

the input port 50i, and the output port 56i. As indicated above, each output port 56i 
20 corresponds to one of the plurality of multi-layer network element ports 38i. While 

illustrated as separate units, the input port 50i and output port 56i corresponding to a 

particular multi-layer network element port 38i are tightly coupled since information 

flows both ways through the network element pons 38. 

After the forwarding logic 52 has determined what to do with the packet, it 
25 passes that information to the input port 50i. If the input port 50i does not filter the 

packet, then it requests pointer to free memory locations in the packet buffer memory 

44 from the packet memory manager 54. The packet memory manager 54 responds by 

providing location addresses of free memory space in the packet buffer memory 44. 

The input port 50i then requests a write access from the packet memory manager 54 
30 and sends the pointer and the data to the packet memory manager 54. 
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In some instances, the input port 50i must make modifications to the packet as 
instructed to do so from the forwarding logic 52. The input port 50i makes these 
modifications prior to the packet being stored in the packet buffer memory 44. When 
requested by the input port 50i, the packet memory manager 54 places the packet into 
5 the appropriate address location specified by the input port 50i. The input port 50i then 
passes information about where the packet is stored to the appropriate output ports 56 
as determined from the information received at the input port 50i from the forwarding 
logic 52. 

In a preferred embodiment, the appropriate output ports may include no output 

10 ports or one or more output ports. The output port 56i requests and receives packets 

from the packet manager 54, and transmits the packet to its associated nerwork element 
port 3 Si when the conditions for transmission are met. In some instances, the output 
port 56i must place its MAC address as the source address on outgoing packets. If this 
situation is dictated by the results from the forwarding logic 52 as passed to the input 

15 port 50i, the input port 50i places such an indication in the packet buffer memory 44. 
The output port 56i detects this indication and replaces the address as the packet leaves 
the output port 56i. Thus, only minor modifications to the packets are necessary on the 
output side of the switching element 36. 

According to the above embodiment, when the forwarding memory 40 contains 

20 matching entries for layer 2 switching or layer 3 routing, the multi-layer network 
element 12 wiii operate at wire-speed. Wire-speed is defined by the speed at the 
maximum packet rate at which a given layer 1 and layer 2 combination can transport 
packets. If an element connected to a network can process packets as fast as they enter 
the element or faster, then the element operates at wire speed. 

25 In a preferred embodiment, the network element 12 processes packets for a 

worst-case scenario of a steady stream of 64-byte packets entering all input pons 50 
simultaneously. If the layer 3 information is not contained in the forwarding memory 
40, the packet is forwarded using layer 2 information and then processed according to 
conventional layer 3 processing by software in the processor 32. 
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Unlike conventional layer 3 processing, the processor 32 may update the 
forwarding memory 40 by placing new layer 3 entries as they are learned and created. 
Any packets matching the new entries are forwarded at wire-speed, i.e. forwarding 
decisions are made for a packet before the next packet arrives. 
5 While the discussion of this invention is described using layer 2 and a 

combination of layers 3 and 4, one skilled in the an would recognize that searching on 
and creating entries in the forwarding memory 40 for any portion of a packet or its 
header, or any combination thereof, readily flows from the description. Thus, this 
invention is not limited to any specific implementation of layers according to the ISO 
10 standard. 

Fig. 4 illustrates the forwarding logic 52 in more detail. The forwarding logic 
52 includes class logic 60, layer 2 (L2) logic 62, layer 3 (L3) logic 64, and merge 
logic 66. The input port 50i connects to the class logic 60, the L2 logic 62, the L3 
logic 64, and the merge logic 66. Only one input port 50i is shown for simplification, 

15 though all input ports 50 are connected in a similar manner. Preferably, the forwarding 
logic 52 is not duplicated for each input port 50i. Instead, all input pons 50 arbitrate 
for access to the forwarding logic 52. 

The L2 logic 62 is connected to the forwarding memory 40 and is responsible 
for creating a key to match against the entries stored in the forwarding memory 40 for 

20 layer 2 forwarding decisions. Depending on the configuration of the forwarding 
memory 40, the key may be applied against all or some of the entries of the 
forwarding memory 40 

During operation, the input port 50i receives a packet from the multi-layer 
network element port 38i and sends the header plus the input port 50i identifier to the 

25 forwarding logic 52. The forwarding logic 52 first searches the forwarding memory 40 
to determine whether the forwarding memory 40 contains an entry for the layer 2 
source transmitting the packet. A matching entry will exist if the multi-layer network 
element 12 has previously received a packet from the same layer 2 source and has 
learned which port it is connected to. If no matching entry exists, the forwarding logic 

30 52 performs a learn function by placing an entry in the forwarding memory 40 
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including the source address. The forwarding logic 52 signals the processor 32 that it 
has learned a new source address. In some instates, the layer 2 source will exist in 
the forwarding memory 40, but will be associated with a different input port 50i than 
the input port 50i of the incoming packet, in this instance, no matching entry will exist 
5 in the forwarding memory 40 because a match depends on both the layer 2 source and 
the input port 50i. 

The forwarding logic 52 also searches the forwarding memory 40 for an entry 
indicating the port of the destination address. If no match is found, then the forwarding 
logic 52 instructs the input port 50i to flood the packet to all of the active output pons 
10 56. 

For the layer 2 information described above in the preferred embodiment, the 
forwarding memory 40 contains the values of the MAC addresses of the sources and a 
pointer to a corresponding entry in the associated memory 42. The forwarding memory 
40 may also contain additional layer 2 information such as a VLAN identifier if tagged 

15 packets arc being used. The associated memory 42 contains more information about its 
corresponding entry in the forwarding memory 40. Layer 2 information in the 
forwarding memory 40 is preferably limited to the least amount of information 
necessary to make a layer 2 search. In a layer 2 search, this information is preferably 
just the MAC address and the input port 50i, but the CAM may also contain any 

20 information relating to tagged addressing. 

In a preferred embodiment, the forwarding memory 40 allows multiple matches 
for a layer 2 search. The processor 32 ensures that the order of the entries is such that * 
if an address/port combination exists in the forwarding memory, that entry is selected. 
If the particular source/port combination is not found, then a match may occur 

5 including VLAN information so that any layer 2 destination search will at least match 
a known VLAN or an unknown VLAN entry, each of which define the output pons 56 
for flooding in its respective entry. 

The L3 logic 64 is connected to the forwarding memory 40 and is responsible 
for creating a key to match against the entries stored in the forwarding memory 40 for 
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layer 3 forwarding decisions. As with the L2 search key, the L3 key may be applied 
against all or some of the entries of the forwarding memory 40. 

To create the key, the L3 logic 64 uses information from the input port 50i 
including the packet header and an input port 50i identifier, and information from the 
5 class logic 60. The merge logic 66 is connected to the class logic 60, the associated 
memory 42, the packet memory manager 54, and the processor 32. The merge logic 
66 uses information from the class logic 60 and information output from the associated 
memory 42 to instruct the input port 50i what to do to properly forward the packet to 
its appropriate destination(s). In some instances, there is no appropriate destination and 

10 the packet is discarded. In other instances, the merge logic 66 will signal the processor 
32 that it must perform some task in response to the received packet. 

Layer 3 switching, while more complex, is similar to layer 2 switching. The 
forwarding logic 52 searches the forwarding memory 40 for a matching entry to a 
layer 3 search key created by the L3 logic 64. If a match exists, the information in the 

15 associated memory 42 is used by the merge logic 66 to instruct the input port 50i what 
to do with the packet. If the search provides no match, the switching element 36 
forwards the packet as a bridge and may pass ail or portions of the packet to the 
processor 32 for further processing. The L3 logic 64 creates the search key using 
information from the packet header, the input port 50i, and the class logic 60. 

20 The class logic 60 examines information in the packet header to determine any 

encapsulation information and to determine a class for the layer 3 information and is 
illustrated in more detail in Fig. 5. The class logic 60 includes the encapsulation logic 
68 and the class action logic 70. Each input port 50i is connected to both the 
encapsulation logic 68 and the class action logic 70. The class action logic 70 is 

25 connected to the encapsulation logic 68, the L3 logic 64, and the merge logic 66. 

The encapsulation logic 68 is responsible for examining the packet header and 
determining any offsets into the header for the layer 3 and layer 4 information, if 
needed. The encapsulation logic 68 includes class filters 72 to determine any offsets 
into the packet to identify locations of relevant information. In a preferred embodiment 

30 one filter 72 recognizes an implementation in accordance with the IEEE 802.3 

18 



SUBSTITUTE SHEET (RULE 26) 



WO 99/00948 PCT/US98/13016 

Standard Ethernet header, and another filter 72 recognizes an implementation in 
accordance with the IEEE Standard 802. lq Tagged Ethernet Header, and still another 
recognizes an LCC SNAP encapsulation. Other encapsulations would become readily 
apparent to one skilled in the an and could be implemented with additional 
5 encapsulation filters 72. The encapsulation logic 68 passes encapsulation offsets to the 
class action logic 70 so that the class action logic 70 knows from where in the packet 
to draw the appropriate field information. 

The class action logic 70 determines to which class a packet belongs. A class is 
used by both the L2 and L3 logics to aid in searching and to add to the functionality 
10 of the multi-layer network element 12. The L2 logic 62 applies a single class to all 
layer 2 searches. Layer 3, on the other hand, has a plurality of programmable classes. 

The classes help to define a class type and for each class which bytes from the 
packet header that should be used in creating the layer 3 search key by the L3 logic 
64, its priority, and a default class result that defines what should happen if no layer 3 
15 match occurs in the forwarding memory 40, 

in a preferred embodiment, there are four possible outcomes when no match 
occurs. First, the header may be sent to the processor 32. This is contemplated when 
the possibility of identifying a layer 3 flow exists. Second, the entire packet could be 
copied to the processor 32. This is contemplated when initially setting a unicast route 
20 or to provide firewall protection by initially examining certain routes or flows or when 
it is unknown where in the packet required information may exist to create search 
keys. Third, use layer 2 results for forwarding. Fourth, discard the packet. Other action 
may be possible depending on the configuration of the network or the particular 
protocol in use as would become readily apparent to one skilled in the an. 
- 5 Some of the criteria that the classes take into account may be whether the class 

is considered address dependent or address-independent. Adding a class identifier 
allows the switching clement 36 to respond to varying network situations and greatly 
simplifies organizing and storing information in the forwarding memory 40. 

Representative examples of address independent classes that could be identified 
0 by the class iogic 60 include: Address Resolution Protocol (ARP); Internet Group 
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Management Protocol (IGMP); Reverse ARP (RARP); Group Address Registration 
Protocol (GARP); Protocol Independent Protocol (PIM); and Reservation Protocol 
(RSVP). Representative examples of address dependent classes include: TCP flow; non 
fragmented UDP flow; fragmented UDP flow; hardware routable IP; and IP version 6. 
5 Of course, other protocols could be similarly recognized. 

The class logic 60 produces an unambiguous class result for every incoming 
packet. For an unrecognized protocol, the class logic 60 will still produce a class 
result, but that class result signifies an unrecognized protocol and what actions should 
take place on a packet of this type of class. 

10 Generally, layer 3 flows are address dependent and will contain information 

beyond just a simple class of traffic. In those instances where additional information 
has been placed by the processor 32 into the forwarding memory 40, there may be 
more than one entry for a particular class in the forwarding memory 40. The processor 
32 ensures that of the entries matched, the one used is the most appropriate one. 

15 Different classes may have different criteria for what is the most appropriate match 
depending on the type of packets embodied within a particular class. The flexibility 
allowed by having multiple matching entries in the forwarding memory 40 is further 
enhanced by ensuring that the best match is provided for a particular flow and because 
of this feature, different actions will be possible for packets within the same type of 

20 class. 

In the preferred embodiment, the processor 32 reorders the layer 3 entries when 
it places any new layer 3 so that the best match for a particular search criteria occurs 
earliest in the memory. Those skilled in the an will recognize many different 
implementations to achieve the same result. In one preferred embodiment, the 

25 processor 32 ensures that the entry with the longest potential matching key within a 
particular class is at the top, or earliest, location in the memory. However, the 
processor 32 may also place an entry above the longest matching entry so that for a 
particular traffic pattern the most important match may be one that matches many 
keys. For example, an entry that matches, for a particular class, based on an 

30 application port such as "hap" and no other information, will take precedence over 
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entries that might match more than just the layer 4 application. Another example might 
be forcing a match on a panicular source within a class type. This might occur when 
the operator might want to provide packets coming from a particular server with a high 
priority regardless of the destination or layer 4 application. 
5 In a preferred embodiment, the merge logic 66 directs the input port 50i to take 

one of the following actions on a packet: filter the packet; forward the packet at layer 
2; forward the packet as a layer 3 flow; process the packet as a layer 3 route; and 
forward the packet as a multicast route. Packets that the merge logic 66 instructs the 
input port 50i to filter are those that include certain header information determined to 

10 be unsupported. Examples of classes whose packets would be forwarded at layer 2 
would include a fragmented UDP flow and a class indicating that the header 
information is unknown. A fragmented UDP operates using layer 2 information 
because after the first packet, the fragmented packets do not include all relevant 
information from the layer 4 header information, UDP pons for example. Layer 2 

15 forwarding would be optional for address independent classes depending on the 
particular class. 

The merge logic 66 instructs the input port 50i to use layer 3 flow information 
for TCP or non-fragmented UDP flows. Flows are those packets forwarded within the 
subnet to which the multi-layer network element 12 is attached and require no header 

20 modification on forwarding. Routes, on the other hand, arc packets coming from 

sources outside the subnet or destined to addresses beyond the subnet such that the 
header information must be modified prior to forwarding by the multi-layer network 
element 12. In a preferred embodiment, instructions to forward the packet as a layer 3 
route come from the merge logic 66 when the class indicates that the packet is of a 

25 class hardware routable IP. In other words, the destination of the incoming packet is 
recognized by the class logic 60 of the multi-layer network element 12, and the multi- 
layer network element 12, must then forward the packet to the next hop destination, 
which is determined by routing protocols. Those skilled in the an can easily recognize 
from the invention other situations where such a type of result would be desired. 
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One fearure of the invention is the ability to bridge flows, that is, use the 
forwarding memory to quickly forward layer 2 packets using layer 3 functionality 
through the network element 12. Certain flows are particularly suited for this type of 
activity and include static flows, self-detecting flows, and flows set up by reservation 
5 protocols, such as RSVP. Static flows are those set up in advance by the network 

element 12 operator and define layer 3 functionality for selected layer 2 network traffic 
and are not subject to aging. Self-detecting flows are a function of the type of 
application. 

Initially, these flows are bridged with no layer 3 functionality because no layer 

10 3 entry matches. The packet header is sent to the processor 32 for examination. The 
processor 32 analyzes the packet and based on programmed heuristics determines 
whether and how to create a layer 3 entry in the forwarding memory 42 for the packet 
type. For example, a "ping" packet would not warrant a layer 3 flow entry because it 
is, at best, a transient packet. 

15 Protocols like RSVP work to reserve certain service features of the network and 

signal that a number of packets will follow this same path. In this case, it serves the 
application using the reservation protocol to forward at layer 2, but add layer 3, or 
more, functionality like priority to ensure the required class of service through the 
multi-layer network element 12. 

£0 Fig. 6 illustrates preferred results produced by the merge logic 66 using 

information from the class logic 60 and the associated memory 42. Three results are 
presently preferred: (1) use the layer 2 forwarding results; (2) use the layer 3 
forwarding results; and (3) use the layer 3 results while using the layer 2 topology. In 
some instances, there may be an identified class, but no .matching entry in the 

15 forwarding memory 40, in this instance, the default actions for the class are used. 

Note that the use of layer 3 default results can be considered a subset of using layer 3 
forwarding results. 

Default results may be set for packets of a class type to provide protection such 
as that provided by firewall technology. In a firewall application, the multi-layer 
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network element 12 would be programmed to direct any packet of a defined class to 
the processor 32 for subsequent processing. 

Referring to Fig. 6, if the class logic 60 determines that the packet is of an 
unrecognized class (step 1 12), then the packet is acted on using the layer 2 results 
5 (step 1 14). If the packet's class is recognized (step 1 12.) and the associated memory 42 
or class logic 60 indicates that a layer 2 result should be forced (step 1 16), then the 
layer 2 results are used (step 1 18) regardless of any other information. 

If no layer 2 results are forced as a result of the layer 2 search (step 1 16) and 
there is a match of the layer 3 key (step 120), then the layer 3 information is checked 

10 to determine whether the layer 3 information forces a layer 2 port decision (step 122). 
If the layer 3 information forces a layer 2 forwarding result, then the output port is 
determined by the results of the layer 2 search, however, any other information found 
in the results of the layer 3 search are applied (step 124) such as QoS factors. If the 
layer 3 results do not call for forcing a layer 2 forwarding result, then the layer 3 

15 results are passed on to the input port 50i (step 126). If there is no layer 3 match in 
step 120, then the default actions for the class generated by the class logic 66 arc 
passed to the input port 50i (step 12S). It is also contemplated that a packet is sent to 
the processor 32 without being forwarded to any output port 56 by the input port 50i 
when using L3 class default action. 

20 Thus, if the class is recognized and the layer 3 search matches an entry, then 

the actions defined by the layer 3 search govern the instructions to the input port 50i, 
even though that might mean that the layer 2 output port results arc used. If not, the 
packet is treated using layer 2 results and the packet or the packet's header might be 
sent to the processor 32 for subsequent processing of the layer 3 information, if 

25 desired. 

If the information coming out of associated memory 42 for a layer 3 match 
indicates a force layer 2 result, then packet forwarding is done using the layer 2 
results, but any information relating to quality of service may still be implemented on 
a layer 2 forwarding decision. In this way, the multi-layer network element 12 may 
30 add additional functionality above and beyond normal layer 2 bridges by allowing 
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quality of service factors to be applied to layer 2 bridging or routing within the same 
subnet or VLAN. 

Accordingly, the input port 50i presents to the forwarding logic 52 the header 
of the received packet and its port designation. The output of the forwarding logic 52 
5 is a function of the header information and the arrival port and indicates whether the 
input port 50i should store the packet in the packet buffer memory 44 in cooperation 
with the packet memory manager 54; whether any priorities should be associated with 
the packet on a particular output port 56i; and whether the input port 50i should make 
any modifications to the packet such as header replacement prior to passing the packet 
10 to the packet buffer memory 44. Thus, an output port 56i need not make any 

modifications to the header except for inserting its MAC address and computing a new 
packet checksum when routing unicast or multicast packets, for example. 

The layer 2 and layer 3 information in the forwarding memory 40 are 
independent of each other as applied to searches although some information contained 
15 in a layer 2 entry may be duplicated in a layer 3 entry. Additionally, a layer 3 entry 
may also contain some layer 4 information such as the UDP or TCP pons. Those 
skilled in the an would readily recognize other features that could be added by 
including other information from other header layers or the packet body and such are 
considered to be within the scope of this invention. After both the layer 2 and layer 3 
20 • searches arc completed, the merge logic 66 determines what actions the input pon 50i 
should do to the packet. 

Any layer 2 learning of source addresses, or changes that might occur as a 
result of a topology change are communicated to the processor 32 as pan of the layer 
2 source search. As mentioned earlier, the layer 2 information may include tagged 
25 information like that used to support virrual LAN (VLAN) information. When and, if 
used, the VLAN information helps to restrict layer 2 flooding to only those pons 
associated with a particular VLAN or specific tagging. 

Each entry in the associated memory 42 may contain information relating to the 
following outcomes. The entry includes an indication of the output pons 56 for the 
30 packet including whether all or portions of the packet should be sent to the processor 
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32. The entry allows for more than one port 56i to be specified, if needed, to support 
for example multicast addressing, for example. The entry also includes a priority for 
the packet which maps into the number of output queues which may be present on an 
output port 56. The entry also includes an indicator for which output pons 56 should 
5 use Best Effort in transmitting the packet. Best Effort implies that no guarantee on the 
packet's transmission or QoS is provided. Those skilled in the an will easily recognize 
that the invention applies equally well to other QoS as well. 

The entry may also indicate whether a new tag should be applied to an 
outgoing packet when, for example, whether routing between VLANs requires an 

10 outgoing tag different from the incoming tag, and what that tag should be, if necessary. 

The entry also contains information relating to source and destination aging. 
Source aging information indicates whether the source is active or not. In a preferred 
implementation, this information is updated by the forwarding logic 52 every time the 
layer 2 source address is matched. The information implements in accordance with 

15 IEEE standard 802. Id type address aging. Destination aging in the network element 12 
indicates which layer 2 and layer 3 entries are active. The information for an entry is 
updated every time an entry is matched, either by a layer 2 destination search or a 
layer 3 match cycle for the entry. 

The entry also provides for whether layer 2 results should be used for 

20 forwarding by the input pon 50L As mentioned above, the layer 2 information may be 
forced for a layer 3 entry but in addition to the layer 2 forwarding information, layer 3 
functionality may be added to the layer 2 forwarding. 

The entry may also define a static entry. A static entry is not subject to layer 2 
learning and is never aged. 

25 Entries for layer 3 may include additional information. The entry may indicate 

that only the first 64 bytes of the packet should be sent to the processor 32 for 
subsequent processing. The entry may indicate whether the packet is pan of a multicast 
routing. If so, then the output pon 50i should decrement the header checksum, forward 
the packet to the indicated output pons 56, and indicate that the output pon 56i need 

30 to replace the layer 2 source address of the packet the output pon 56i's MAC address. 
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Other types of header modifications will be readily apparent to those skilled in the an 
to implement proper routing. 

The entry in the associated memory 42 may also include the next hop 
destination address to be used to replace the incoming destination in unicast routing. In 
5 a unicast route, the incoming packet would have had its destination address as the 
multi-layer network element 12. 

The merge logic 66 must wait for the results of the searches of the forwarding 
memory 40 done by the L2 logic 62 and the L3 logic 64. In the preferred embodiment, 
the layer 2 and layer 3 information are stored in the same forwarding memory 40, 
10 however, they could be stored in separate memories. As stated earlier, the preferred 
embodiment has the forwarding memory 40 limited to storing the information used by 
the L2 and L3 logics that match the fields of the key to reduce the size of the 
forwarding memory. As such, the associated memory 42 stores additional information 
. about the entries. Each entry in the forwarding memory 40 points to a corresponding 
15 entry in the associated memory 42, the contents of which the associated memory 42 
provides to the merge logic 66 to makes its forwarding decisions. 

Fig. 7 illustrates the steps occurring in the forwarding logic 52. While the Fig. 
7 illustrates the preferred embodiment of the operation of the forwarding logic 52, 
those skilled in the an will immediately recognize other equivalent ways to accomplish 
20 the same task. Information is received at the forwarding logic 52 from the input port 
50 (step 200). On one path, the L2 logic 62 determines the necessary information for a 
layer 2 search and carries out that search against the forwarding memory 40 (step 202). 
The L2 logic 62 and forwarding memory 40 determine in step 204 whether there was a 
matching entry for the source of the packet (step 204). If the source address is not in 
25 the forwarding memory 40, then the source address is learned (step 206). To learn the 
source address, the L2 logic 62 and the forwarding memory 40 ensure that an entry is 
placed in the forwarding memory. A signal is sent to the processor 32 to examine the 
new information. 

If the source address was already in the forwarding memory 40 and matched to 
30 the input port 50 of arrival, then the L2 logic 62 attempts to match the destination 
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address to the forwarding memory 40 (step 208). If the source address was not in the 
forwarding memory 40 or the source address was in the memory but at a different 
port, then the source address and port combination is learned in step 206 prior to 
artempting a destination search in step 208. 
5 In the other path from step 200, the class logic 60 determines the class in step 

210. After the class logic 60 has determined the class and passed this onto the L3 logic 
62, the L3 logic attempts a match against the forwarding memory for the layer 3 entry 
(step 212). 

In step 214, the merge logic 66 uses information from the L2 search of step 
10 208, if there was one, the class logic results from step 210, and the layer 3 search 
results from step 212 to make the appropriate forwarding decisions based on the 
criteria of Fig. 6. Once the merge logic 66 has determined the appropriate forwarding 
decision in step 214, the results are passed to the output port 50i (step 216). 

Fig. 7 illustrates the flow proceeding down two paths. Because the layer 2 and 
15 layer 3 searches are independent, everything but the actual memory search may be 

pipelined or accomplished in parallel. In a preferred implementation, the processing by 
the class logic 60, the L2 logic 62, and L3 logic 64 may proceed in a parallel or 
pipelined fashion except where dependencies prevent such action. For example, the L3 
logic 64 requires the output of the class logic 60 to create the search key for the layer 
20 3 search and the merge logic 66 requires that the layer 2 and layer 3 searches be 
finished to merge the results according to Fig. 6. 

In another embodiment, however, the L2 information and the L3 information 
may be in separate memories. In this case the L2 and L3 searches may occur 
simultaneously. 

25 After the merge logic 66 determines the actions on the packet, the input port 

50i makes write requests to the packet manager 54 if the packet is not to be filtered, or 
dropped. The packet need not be received in its entirety before the input port 50t 
- makes write requests to the packet manager 54. The input port 50i passes to the packet 
manager 54 the address where the incoming portion of the packet is to be stored, the 

30 number of output pons 56 that the packet will be output, the priority of the packet, and 
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then delivers the pointers to the appropriate output port(s) 56. The input port 50i 
receives pointers to free memory locations in the packet buffer memory 44 where the 
packet may be placed. Preferably, the input port 50i has obtained a pointer from the 
packet buffer manager 54 prior to making write requests. 
5 The output port 56i stores the pointers in output queues for packet transmission. 

When a queue presents a pointer for transmission, the output port 56i requests the 
contents stored at the pointer address from the packet manager 54 and transmits the 
contents out of the multi-layer network element 12 on the corresponding network 
element port 38. The packet manager 54 keeps track of whether all of the output port 
10 56 using a particular pointer have transmitted the contents associated with that pointer, 
if so the memory space is freed for future use. 

In the preferred embodiment, the switching element 36 and all of its 
constituents are implemented in hardware. Also, in the preferred embodiment, the 
forwarding memory 40 and associated memory 42 arc implemented in hardware. 

15 In an alternate preferred embodiment, the switching element 36 and all its 

constituents are implemented in hardware on an application specific integrated circuit. 
Equally contemplated, an integrated circuit could contain a hardware implementation of 
switching element 36, and any combination or portion thereof, of the processor 32, the 
processor memory 34, the forwarding memory 40, the associated memory 42, and the 

20 packet buffer memory 44. 

A multi-layer network element has been described that combines the features of 
quick layer 2 bridge-type forwarding and combines it with the added functionality of 
layer 3 routing and QoS support to create an apparatus and method of its use to 
perform both layer 2 and most layer 3 forwarding decisions prior to the receipt of the 

25 next packet. 

The foregoing description of the preferred embodiments of the multi-layer 
network element has been presented for purposes of illustration and description. It is 
not intended to be exhaustive or to limit the invention to the precise form disclosed, 
and modification and variations are possible in light of the above teachings or may be 
30 acquired from practice of the invention as disclosed. The embodiments were chosen 

28 
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and described in order to explain the principles of the invention and its practical 
application to enable one skilled in the an to utilize the invention in various 
embodiments and with variation modifications as are suited to the particular use 
contemplated. It is intended that the scope of the invention be defined by the claims 
5 appended hereto, and their equivalents. 
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CLAIMS 

What is claimed is: 

5 

1. A method for making a forwarding decision for a packet entering a 
network- element having at least one input port and at least one ourput port, wherein 
the packet enters the network clement on an input port and exits the network element 
on appropriate output ports, if any, including the steps of: 
10 (1) receiving a first header portion of the packet; 

(2) determining a first search key from the first header portion; 

(3) causing a memory to output first forwarding information 
associated with the first search key; 

(4) receiving a second header portion of the packet; 

15 (5) determining a class information for the packet based on the 

second header portion, wherein each class information includes a class, second header 
key information indicating which fields of the second header portion should be used to 
create a second search key, and default forwarding information for packets falling 
within the class; 

20 (6) creating the second search key from the second header portion 

based on the second header portion key information; 

(7) causing the memory to output second forwarding information, if 
any, associated with the second search key; 

(3) determining the appropriate output ports, if any, based on the 
25 first forwarding information, the second forwarding information, and the default 
forwarding information. 



30 



30 



3NSDOCID <WO 990Q948AI I _> 



SUBSTITUTE SHEET (RULE 26) 



WO 99/00948 PCT/US98/13016 

2. The method of claim I, wherein: 

step (3) includes determining, as a function of the first destination, a 
first address, whose corresponding contents in the memory stores the first forwarding 
information; and 

5 step (7) includes determining, as a function of the search key, a second 

address, whose corresponding contents the memory stores the second forwarding 
information. 



3. The method of claim 2, wherein determining the first address includes 
10 searching a content-addressable memory with the first destination address to produce 
the first address; and 

determining the second includes searching the content-addressable 
memory with the search key to produce the second address. 

15 4. The method of claim 2, further including the step of providing the 

memory as a first memory for storing first forwarding information and a second 
memory for storing second forwarding information. 

5. The method of claim 3, further including the step of providing the 
20 content-addressable memory as a first content-addressable memory that stores the first 
address and a second content-addressable that stores the second address. 



6. The method of claim 2, wherein determining the first address includes 
using a hashing function on the first destination to produce the first address; and 

25 determining the second address includes using a hashing function on 

the search key to produce the second address. 

7. The method of claim 1, wherein step (8) is carries out using only the 
first forwarding information when the first forwarding information so indicates. 
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8. The method of claim I, wherein step (8) is carried out using only the 
first forwarding information when the second forwarding information so indicates. 



9. The method of claim 1. wherein step (8) is carried out, when the second 
5 search key fails to output second forwarding information, only the first forwarding 

information and default forwarding information. 

10. The method of claim 9, wherein step (8) is carried out using only the 
first forwarding information when the default forwarding information so indicates and 

10 the second search key fails to output second forwarding information. 

1 1. The method of claim I, wherein step (8) is carried out using only the 
second forwarding information. 

15 12. The method of claim 1, wherein step (8) is carried outs using a 

combination of the first forwarding information and second forwarding information. 

13. The method of claim 9, wherein the second forwarding information 
includes quality of service information. 



20 
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14. The method of claim 13, wherein the quality of service information 
includes a priority for the packet. 
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15. An apparatus for making a forwarding decision fcr n packet having a 
header, the packet being provided as input to a network element having at least one 
input port and at least one output port, wherein the packet enters the network element 
on an input port and exits the network element on one or more appropriate output 
5 pons, if any, comprising: 

class logic configured to output class information for the packet based 
on the header, including a class, key information which identifies portions the header, 
and default forwarding information for packets falling within the class; 

search logic configured to output, based on the header, a first search 
10 key, and, based on the header, the class, and the key information, a second search key; 

a memory configured to output a first forwarding result in response to 
the first search key, and outputs a second forwarding result, if any, in response to the 
second search key; 

merge logic configured to output information about appropriate output 
15 ports in response to the default forwarding information, the first forwarding result, and 
the second forwarding result; and 

forwarding logic configured to direct the packet from the input port to 
the appropriate output ports, if any, based on the information about the appropriate 
output ports. 

20 

16. The appararus of claim 15, wherein the memory includes: 

interface logic configured to output, as a function of the first search key, 
a first address, and that outputs, as a function of the second search key, a second 
address; and 

25 wherein the memory is configured to output the first forwarding results 

in response to the first address and output the second forwarding results in response to 
the second address. 

17. The apparatus of claim 15, wherein the memory includes: 
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a content-addressable memory configured to output a first forwarding 
information address when accessed using the first search key, and a second forwarding 
information address when accessed using the second search key; and 

a forwarding memory configured to output the first forwarding result 
5 when accessed using the first forwarding information address, and outputs the second 
forwarding address when accessed using the second forwarding address. 

18. The apparatus of claim 17, wherein the interface logic is a content- 
addressable memory. 

10 

19. The apparatus of claim 18, wherein the content-addressable includes: 
a first content-addressable memory configured to output the first 

forwarding information address when accessed using the first search key; and 

a second content-addressable memory configured to output the second 
15 forwarding information address when accessed using the second search key. 

20. The apparatus of claim 17, wherein the default forwarding information 
and the second forwarding result contains quality of service information. 

20 21. The apparatus of claim 17, wherein the merge logic is configured to 

output information about appropriate output pons, when the memory fails to output the 
second forwarding result, using only the default forwarding information and the first 
forwarding result. 

25 22. The apparatus of claim 21, wherein the default forwarding information 

contains qualiry of service information. 

23. The apparatus of claim 15, wherein the class logic includes at least one 
encapsulation filter that outputs pointers to locations in the header based on 
30 encapsulation information about the packet. 

34 
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